TITLE OF THE INVENTION 

STORAGE SYSTEM 

BACKGROUND OF THE INVENTION 
5 The present invention relates to a storage system 

provided with a cache memory and a control method of the 
cache memory. 

As a technique for improving a performance of a 
storage system having plural storage units (hereinafter 
10 referred to as just "storage system" ) f there has been well- 
known a technique of introducing a volatile semiconductor 
memory section (hereinafter referred to as "cache memory") 
to the storage system. 

With respect to a data write request, a storage system 
15 having a cache memory returns a response of completing the 
write to a computer that requires the data write at the 
time when the data is written in the cache memory 
(hereinafter referred to as just "computer" or "host 
computer" ) , while asynchronously writes data to a storage 
20 unit. Since the speed of writing data to the cache memory 
is faster than the storage unit (a disk drive or the like 
here) , the storage system can return the response to the 
host computer with higher speed. 

However, latest data is present only in the cache 
25 memory until the data is written in the storage unit, so 



that the storage system is required to enhance reliability 
of the cache memory. 

A known method as a technique to improve the 
reliability of the cache memory is the one for establishing 
5 a redundant configuration of the cache memory. Methods for 
establishing the redundant configuration include storing a 
copy of data in plural cache memories (mirroring) or a 
cache memory having a RAID construction disclosed in 
Japanese laid-open publication number Hei 9-265435. 

10 Further, there has been known a control method ("write 

through control") for surely storing data in the storage 
unit with respect to each write request in order to 
maintain the reliability of the storage system also in a 
case where the redundancy of the cache memory is lost due 

15 to a failure or the like of the cache memory. However, the 
reliability is maintained by the write through control, but 
the abovementioned advantage of the cache memory is lost, 
with the result that, even if the cache memory is possessed, 
the response speed to the write request becomes 

20 approximately equal to the case where the cache memory is 
not possessed. 

In view of this, techniques have been devised for 
increasing the redundancy of the cache memory so as not to 
require the write through control. Such techniques include, 

25 for example, providing a spare cache memory or providing 
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three or more cache memories for which the remaining write 
data that is to be written in the region covered by the 
cache memory having the failure occurring is shared by the 
remaining cache memories as disclosed in Japanese laid-open 
5 publication number 2001-344154. 

A demand for configuring such a storage system in 
further a large scale has currently been risen. However, a 
conventional technique unifiedly utilizes a cache memory. 
Therefore, as the configuration scale of the storage system 

10 becomes large, an access is concentrated on the cache 
memory or information for managing the cache memory, 
thereby entailing a problem that a throughput performance 
of the storage system is difficult to be maintained only by 
having the cache memory. 

15 Further, the problem same as the abovementioned one is 

applied to the case of maintaining the reliability of the 
storage system and write performance in a case where the 
failure occurs in the cache memory. Specifically, the 
technique disclosed in the aforesaid Japanese laid-open 

20 publication number 2001-344154 unifiedly utilizes the cache 
memory, so that as the configuration scale of the storage 
system becomes large, an access is concentrated on the 
cache memory or information for managing the cache memory 
upon the occurrence of the failure in the cache memory, 

25 thereby entailing a problem that a throughput performance 



of the storage system is difficult to be maintained only by 
having the cache memory, and consequently, it is difficult 
to provide both the enlargement of the configuration scale 
and the reliability upon the occurrence of the failure. 

SUMMARY OF THE INVENTION 
An object of the present invention is to provide a 
storage system capable of having a large-scale 
configuration for maintaining a write access response speed 
and reliability upon the occurrence of a failed cache and 
its control method. 

In order to attain the abovementioned object, the 
present invention comprises the following structure. It is 
a storage system having plural control units and storage 
units. Further, each of the plural control units has a 
memory, for example, a cache memory. In the storage system 
having this structure, a first control unit among the 
plural control units, when receiving data from a computer 
connected to the storage system, stores the received data 
into the memory possessed by the first control unit and the 
memory possessed by the other control unit (hereinafter 
referred to as "a second control unit"), and then, 
transfers the data to the storage unit. 

Moreover, in a case where a failure occurs in the 
second control unit in the abovementioned configuration, a 



copy of the data received from the computer is newly stored 
in the memory of a third control unit. 

Additionally, a configuration may be applied wherein 
the second control unit stores the data received from the 
computer in the memory possessed by the first control unit 
and the memory possessed by the second control unit. 

Further, in a case where the failure occurs in the 
second control unit, a configuration may be applied wherein 
the first control unit which is designated as a pair 
substitutionally performs a processing of the second 
control unit. In this case, the first control unit stores 
a copy of data received from the computer during the 
substitutional processing of the second processing unit in 
a memory possessed by the other control unit that is not 
the pair. 

Moreover, a configuration may be applied wherein the 
first control unit and the second control unit that form a 
pair respectively is supplied electric power from a 
separate power supply. 

Further, a configuration may be applied wherein the 
plural control units are connected to one another via a 
switch. 

Additionally, a configuration can also be applied 
wherein each control unit is connected to computer via an 
interface unit . 
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Moreover, a configuration can also be applied wherein 
the storage system has a management unit that has 
information showing a corresponding relationship between 
the plural control units and the storage units, each 
5 control unit being operated based upon this information. 
Further, a configuration can also be applied wherein the 
storage system does not have a management unit but each 
control unit has the abovementioned information. 

Additionally, the control units that are in pairs is 
10 configured to be connected to the same storage unit. 

Further, even in the configuration wherein the 
management unit finds the failure in the control unit, a 
configuration can also be applied wherein the other control 
unit or interface unit detects the failure in the control 
15 unit. 



BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a view schematically showing a storage 
system of a first embodiment according to the present 
20 invention ; 

Fig. 2 is a view showing a configuration example of a 
cache adapter 3; 

Fig. 3 is a view representing management tables 11 and 

31; 

25 Fig. 4 is a flowchart showing a process flow of a read 
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requesti- 
ng. 5 is a flowchart showing a process flow of a 

write requesti- 
ng. 6 is a view showing an arrangement of a region 
5 allocation of a cache memory in a normal state; 

Fig. 7 is a view showing an arrangement of a region 

allocation of the cache memory in a case where a failure 

occurs in a CA2; 

Fig. 8 is a view showing an arrangement of a region 
10 allocation of the cache memory in a case where a failure 

occurs in the CA2 ; 

Figs. 9(A) and 9(B) are views showing a master 

management table 71; 

Fig. 10 is a flowchart including a processing in a 
15 manager adapter 7 when a failure occurs in the CA2; 

Fig. 11 is a view showing a comparison in size of a 

write cache region; 

Fig. 12 is a flowchart showing a processing of the 

other cache adapter in a case where a failure occurs in the 
20 CA2; 

Fig. 13 is a flowchart showing a processing when the 
CA2 is recovered from the failure occurring therein; 

Fig. 14 is a flowchart showing a processing when a 
failure occurs in a cache adapter pair and it is recovered 
25 from the failure; and 
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Fig. 15 is a flowchart showing a processing performed 
not through the manager adapter 7 in a case where a failure 
occurs in the CA2 . 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
An embodiment of the present invention will be 
explained hereinbelow with reference to drawings. It is 
needless to say that the present invention is not limited 
to the embodiments described hereinbelow. 

Fig. 1 is a view showing a first embodiment of a 
storage system to which the present invention is applied. 
The storage system has a disk controller 5 and plural disk 
drives 4 . It is to be noted that the disk drive 4 means a 
storage unit having a non-volatile storage medium such as a 
hard disk drive, CD, DVD or the like. The disk controller 
5 is connected to a host computer 6 via a communication 
line (hereinafter referred to as "channel") 61. Further, 
the disk controller 5 and the disk drives 4 are connected 
to each other via a communication line (hereinafter 
referred to as "disk channel") 41. The host computer 6 
sends or receives data to or from the disk drives 4 via the 
channel 61, the disk controller 5 and the disk channel 41. 

A protocol such as, for example, SCSI (Small Computer 
System Interface), Fibre Channel, or the like is used at 
the channel 61 and disk channel 41. Moreover, the channel 



\ 




10 



network relating to receiving and sending information for 
the system management . 

The host adapter 1 is an interface device that 
receives an access request from the host computer 6 via the 
channel 1, performs an analysis of the access request based 
upon a management table 11 provided in the host adapter 1, 
communicates with a suitable cache adapter 3 via the 
internal link 21 and returns a response to the host 
computer 6 . 

The cache adapter 3 is connected to the disk drive 4 
via the disk channel 41 so as to communicate with the host 
adapter 1 or the other cache adapter 3 via the internal 
switch 2. Further, the cache adapter 3 controls read of 
data from the disk drive 4 or write of data to the disk 
drive 4 based upon the communication from the host adapter 
1. Moreover, the cache adapter 3 is a controller for 
controlling a cache memory 32 possessed by the cache 
adapter 3 to perform a storage of data to the cache memory 
32 . 

It is to be noted that the cache adapter 3 basically 
stores in the cache memory 32 only data relating to the 
read of data or write of data stored in the disk drive 4 
connected to itself. In other words, data of the disk 
drive 4 managed by the other cache adapter 3 is not stored 
as generally used data in the cache memory 32 of the other 



cache adapter 3 . 

The cache adapter 3 also performs a control of a 
redundancy with respect to the disk drive 4 (for example, 
making each RAID level redundant). Further, an another 
port of each disk drive 4 connected to a certain cache 
adapter 3 via one port is connected to another cache 
adapter 3 that is in pairs with this cache adapter 3 for 
making the cache adapter itself redundant. 

Although the present embodiment explains about the 
configuration wherein plural pairs of cache adapters 3 are 
stored in one disk controller 5, other configuration may be 
applied wherein a single device is composed of one pair and 
a disk drive 4 that is common to this pair, such device 
being connected to one another via the switch . In this 
case, a device for management (manager adapter) manages 
each pair via the switch . 

The manager adapter 7 is provided with a master 
management table 71 having registered therein information 
about the configuration of the storage system. The manager 
adapter 7 changes the content of the master management 
table 71 and transmits necessary information to the host 
adapter 1 or the cache adapter 3 in a case where the 
configuration or the like of the storage system is changed. 

The power supply units A511 and B512 are respectively 
connected to an external power supply (not shown) such as a 
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commercial power supply for supplying electric power to the 
storage system. It is desirable that the power supply 
units A511 and B512 are respectively connected to an 
external power supply of a different system in order to 
take precautions against an accident of the power supply. 
Moreover, electric power is supplied to a cache adapter 3 
that pairs with a certain cache adapter 3 from a separate 
power supply units A511 and B512 in order to ensure a 
redundancy of the cache adapter 3 . 

It is to be noted that other device configurations 
include the one wherein the host adapter 1 is not present 
and each cache adapter 3 is connected to one another by the 
switch provided with the management table 11 that is 
possessed by the host adapter 1. In this case, the switch 
is connected to plural channels 61. Further, the switch 
has the management table 11 every channel 61, so that it 
transfers an access request of the host computer 6 to the 
cache adapter 3 based upon the respective management tables 
11. 

Additionally, the cache adapter 3 performs the 
communication with the host computer 6, protocol conversion 
or the like that is executed by the host adapter 1 in a 
case where the switch is utilized. 

Fig. 2 is a view representing a configuration example 
of the cache adapter 3. The cache adapter 3 has a cache 
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memory 32, an internal link interface (hereinafter referred 
to as "I/F" ) unit 33 connected to the internal link 21 , a 
disk channel I/F unit 34 connected to the disk channel 41, 
a processor 37, a control memory 36 and a peripheral 
control unit 35. 

The cache memory 32, internal link I/F unit 33 and the 
disk channel I/F unit 34 are connected to one another with 
a cache data bus 38. The internal link I/F unit 34 and the 
disk channel I/F unit 34 can perform a Direct Memory Access 
(DMA) transfer between devices. Specifically, the internal 
link I/F unit 33 stores data received from the host 
computer 6 via the internal link 21 in the cache memory 32 
via the cache data bus 38. When receiving the read request 
from the host computer 6, the internal link I/F unit 33 
takes out the data stored in the cache memory 32 via the 
cache data bus 38, and then, transfers it to the host 
adapter 1 through the internal link 21. 

The disk channel I/F unit 34 takes out data stored in 
the cache memory 32 via the cache data bus 38 and stores 
the same in the disk drive 4 via the disk channel 41 
(hereinafter referred to as "destaging"). Further, the 
disk channel I/F unit 34 takes out data stored in the disk 
drive 4 via the disk channel 41 and stores the same in the 
cache memory 32 via the cache data bus 38 (hereinafter 
referred to as "staging"). 
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The internal link I/F unit 33 and the disk channel I/F 
unit 34 execute a processing such as the abovementioned 
staging and destaging based upon the control of the 
processor 37 via the control data bus 39. 
5 The processor 37 is connected to the control memory 36 

and the control data bus 39 via the peripheral control unit 
35 including a memory control circuit or a bus control 
circuit. Stored in the control memory 36 are the 
management table 31, control program 361 and directory 

10 information 362. 

Registered in the management table 31 is information 
representing a corresponding relationship among a logical 
device (hereinafter referred to as "LDEV") designated from 
the host adapter l f a virtual device (hereinafter referred 

15 to as "VDEV" ) in a case where plural disk drives 4 are 
virtually managed as one device and the cache adapter 3 
(hereinafter referred to as "backup cache adapter") that 
makes the data stored in the LDEV redundant (copies here) 
to store it. 

20 The control program 361 is a program executed at the 

processor 37 when the processor 37 executes a control of 
each component possessed by the cache adapter 3. The 
directory information 362 is information representing a 
storage sate of data in the cache memory 32 such as the 

25 existence of the data to be accessed in the cache memory 32, 

I 
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an address in the cache memory or the like. 

Fig. 3 is a view showing content examples of the 
management table 11 possessed by the host adapter 1 and the 
management table 31 possessed by the cache adapter 3. The 
5 management table 11 has plural entries, each of which has 
fields 111 and 112. The field 111 has registered therein a 
logical unit number (LU number) designated by the host 
computer 6 upon the access. 

The filed 112 has subfields 1121, 1122 and 1123 in 

10 which information relating to the cache adapter 3 is stored. 
Registered in the subfield 1121 is a logical device number 
(LDEV number) corresponding to the LU registered in the 
field 111 and managed by the cache adapter 3. Registered 
in the subfield 1122 is information showing the cache 

15 adapter 3 that executes a staging or destaging, i.e., the 
cache adapter (hereinafter referred to as "master cache 
adapter") that executes the normal data write/read to/from 
the disk drive 4. Registered in the subfield 1123 is 
information showing the backup cache adapter that makes 

20 redundant the data stored in the cache memory 32 of the 
master cache adapter registered in the subfield 1122. 

In the management table 31, mapping information 
showing a relationship among the LDEV, the VDEV and the 
backup cache adapter as described above . The management 

25 table 31 also has plural entries, each of which has fields 
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311, 312, 313 and 314. Registered in the field 311 is 
information showing the LDEV number of the LDEV 
corresponding to one entry. Registered in the filed 312 is 
information showing the backup cache adapter that makes 
redundant the data of the LDEV registered in the field 311. 

in the field 313, information showing a VDEV number 
corresponding to the LDEV registered in the field 311 is 
registered. Registered in the field 314 is information 
showing the virtual device address (hereinafter referred to 
as "VDEV address") representing which section in the 
corresponding VDEV is allocated to the LDEV registered in 
the field 311. It is to be noted that the VDEV is 
designated by an administrator of the storage system 
through a console connected to an SVP (not shown) or the 
manager adapter 7 or by transmitting a special command 

through a channel 61. 

If the backup cache adapter registered in the field 
312 is the cache adapter 3 having its management table 31. 
this cache adapter 3 performs a processing for making the 
write data redundant as the backup cache adapter with 
respect to the LDEV designated by the LDEV number 
registered in the corresponding field 311. Specifically, 
the backup cache adapter receives the write data that is to 
be made redundant from the host adapter 1 or the master 
cache adapter 3. and stores the data in the cache memory 32 
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Fig. 4 is a flowchart showing a procedure at each 
adapter in a case where the storage system receives the 
read request from the host computer 6. 

Firstly, the host adapter 1 receives the read request 
from the host computer 6 via the channel 61. The host 
adapter is described as HA and the master cache adapter is 
described as CA(m) hereinbelow (step 2001). 

The HA1 receiving the read request searches the LDEV 
number corresponding to the LU number designated by the 
read request and information of CA(m) from the management 
table 11 (step 2002). Thereafter, the HA1 transmits the 
internal read request to the searched CA(m) via the 
internal link 21. The "internal read (write) request" is 
here a message of data read (data write) received and sent 
between the host adapter 1 and the cache adapter 3 (step 
2003) . 

The CA(m) receiving the internal read request 
determines whether data corresponding to the read request 
is present or not in the cache memory 32 by the directory 
information 362 based upon the address, size or the like 
included in the internal read request (hereinafter referred 
to as "cache hit determine") (step 2004). In a case where 
the corresponding data is not present in the cache memory 
32 as a result of the determine (hereinafter referred to as 
"cache miss"), the CA(m) stages the corresponding data from 
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the disk drive 4 to store it in the cache memory 32, 
thereby renewing the corresponding directory information 
362 (step 2005) . 

After the processing at the step 2005 or in a case 
5 where the step 2004 determines that the corresponding data 
is present in the cache memory 32, the CA(m) reads the 
corresponding data from the cache memory 32 via the 
internal link interface unit 33 to thereby transfer it to 
the HA1 that has sent the internal read request (step 2006). 

10 The HA1 receiving the data responds the received data 

to the host computer 6 (step 2007). 

Fig. 5 is a flowchart showing a process flow when the 
storage system receives a write request of data from the 
host computer 6. The backup cache adapter is described as 

15 a CA(b) hereinbelow. 

The HA1 receiving the write request from the host 
computer 6 via the channel 61 searches the LDEV number 
corresponding to the LU number included in the write 
request and the information of the CA(m) and CA(b) from the 

20 management table 11. 

Thereafter, the HA1 transmits the internal write 
request to the searched CA(m) via the internal link 21 
(step 2101).. The CA(m) that receives the internal write 
request determines from the directory information 362 

25 whether the cache memory 32 has a region that can store 
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data (hereinafter referred to as "write data") 
corresponding to the write request (step 2104). 

If there is no region capable of storing, the CA(m) 
decides, based upon the LRU algorithm or the like, which 
data corresponding to which LDEV and LDEV address in the 
cache memory 32 is written in the disk drive 4, then, 
writes the data in the disk drive 4 and thereafter, 
invalidates the corresponding region to thereby secure the 
region that can store the write data. Further, the CA(m) 
informs the CA(b) of the LDEV number and LDEV address of 
the invalidated data. 

The CA(b) that is informed invalidates the 
corresponding data to secure the region that can store the 
write data. After that, the CA(b) informs the CA(m) that 
the region is secured. It is to be noted that, since the 
cache memory 32 of the CA(b) has stored the data same as 
that in the CA(m) (the address may be different) as to the 
write data, the determine result performed at the step 2104 
can be applied as it is to the determine for the storage 
region in the CA(b) (step 2105). 

After the step 2105 or in a case where the step 2104 
determines that there is a storage region, the CA(m) 
transmits to the CA(b) an internal backup write request 
that is an internal message corresponding to the internal 
write request received at the step 2105. When each of the 
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from both of the CA(m) and CA(b) transmits a write complete 
response to the host computer 6 via the channel 61 (step 
2113) . 

Explained herelnbelow are two methods for a region 
allocation of the cache memory 32 , taking an operation 
among four cache adapters 3 as an example. For simplifying 
the explanation, a write cache region that stores the write 
data is only illustrated hereinbelow to be explained among 
storage regions possessed by each cache memory 32. However, 
the cache memory 32 has actually a read cache region too 
for storing the read data. Further, each cache adapter is 
referred to as CA1, 2, 3 and CA4 for distinguishing the 
cache adapters 3. 

Fig. 6 is a view showing the allocation of the storage 
region in the cache memory among the cache adapters 3 in a 
normal state where a failure does not occur. In this 
figure, the CA1 and CA2 as well as the CA3 and CA4 become 
pairs connected to the common disk drive 4 for ensuring the 
redundancy (this relationship is referred to as "cache 
adapter pair"). The cache memory 32 in the CA1 has a write 
cache region CA1(M) 30121 having stored therein data that 
the CA1 should execute the staging operation and destaging 
operation as the master cache adapter and a write cache 
region CA2(B) 30122 having stored therein a copy of the 
write data as the backup cache adapter of the CA2 . 
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Hereinbelow, the storage region where the data handled 
by the master cache adapter is called a master region and 
represented by (M) , while the storage region where the data 
(copied data) handled by the backup cache adapter is called 
a backup region and represented by (B) . 

The cache memory 32 of the CA2 that is a cache adapter 
pair of the CA1 includes a write cache region CA2(M) 30221 
and a write cache region CA1(B) 30222. Accordingly, the 
data included in the write cache region CA1(M) 30121 is the 
same as the data included in the write cache region 
CA1(B)30222 (the address and the order may be different). 

Specifically, in this figure, two cache adapters 3 
that form the cache adapter pair provide the storage region 
for storing the copy of the each other's data to the cache 
memory 32 to thereby make the write data of the cache 
adapter pair redundant. Arrows in this figure represent 
the relationship among the abovementioned copy of the data. 
Like the cache adapter pair of the CA1 and CA2 , the cache 
adapter pair of the CA3 and CA4 makes redundant the data 
stored in both cache memories 32. 

Fig. 7 is a view showing the allocation of the storage 
region in the cache memory in a case where the cache memory 
32 possessed by the CA2 becomes unusable due to a cause 
that a failure occurs therein or the cache memory is 
eliminated therefrom. Two power supply units A511 and B512 
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respectively supply electric power to the CA1, CA3 and CA2 , 
CA4 so that the operation can be continued even if the 
failure occurs in one of the power supply units . When the 
cache memory 32 possessed by the CA2 becomes unusable, the 
5 CA1 handles as the master region of the CA2 the copy of the 
master region of the CA2 possessed by the CA1, i.e., the 
CA2(B). Specifically, the CA1 operates also as the master 
cache adapter as for the write cache region CA2(M) 30221. 

On the other hand, the write cache region CA1(B) 30222 

10 and the write cache region CA2(B) 30122 that are the backup 
regions arranged in the CA2 and the CA1 are arranged in the 
CA4. Specifically, a copy of the data stored in the cache 
memory 32 in the CA1 is stored in the cache memory 32 in 
the CA4 . This makes the data stored in the cache adapter 3 

15 redundant by the cache adapter 3 other than the cache 

adapter pair even if one cache adapter 3 becomes unusable. 

The copy of the data of the CA1 can be arranged in the 
CA3 instead of the CA4, but electric power is supplied to 
either one of the master region and the backup region by 

20 arranging the copy of the data in the CA4 to which electric 
power is supplied from the power supply unit B512 that is 
different from the power supply unit A511 supplying 
electric power to the CA1 even if the failure further 
occurs in either one of the power supply units, thereby 

25 being free from losing the write data. 
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It is to be noted that, in a case where the capacity 
of the cache memory 32 in the CA4 is the same as that in 
the CA1, all data possessed by the CA1 cannot be stored in 
the cache memory 3042 in the CA4 (a normal use of the CA4 
5 becomes impossible), so that the CA1 destages half of the 
data stored in the cache memory 32 to the disk drive 4 and 
then , arranges the copy of the remaining data in the CA4 . 
The data amount to be destaged may not particularly be a 
half, but a half amount is desirable in view of the cache 

10 hit ratio of the cache memories 32 in the CA1 and the CA4 . 
Fig. 8 is a view showing another example of the 
allocation of the storage region of the cache memory in the 
state same as that shown in Fig. 7. Fig. 8 is the same as 
Fig. 7 in the data storing method to the CA1 , but is 

15 different from Fig. 7 in the arrangement of the copy of the 
data stored in the CA1 . Specifically, the write cache 
region CA1(B) 30222 and the write cache region CA2(B) 30122 
that are the backup regions arranged in the CA2 and the CA1 
before the CA2 becomes unusable are arranged in the CA4 and 

20 CA3 respectively. 

In this case, sufficient data amount destaged by the 
CA1 is, unlike Fig. 7, a third of the original storage 
region. This is because the region occupied by the copy of 
the data of the CA1 in the individual cache adapter 3 is 

25 decreased since, unlike Fig. 7, the copy of the data stored 
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in the CA1 is dispersed to be arranged in the cache 
memories of the CA4 and CA3 . This enables to make the 
write data redundant by the cache memory 32 even if one 
cache memory 32 becomes unusable, thereby enlarging the 
write cache region compared to the case of Fig. 7. 

Fig. 11 is a view showing another example of the 
allocation of the storage region of the cache memory. In 
this example, new cache adapter pair composed of a CAS and 
CA6 is newly added, whereby the size of the write cache 
region per one cache adapter 3 can be enlarged compared to 
the other cases even with the backup data of the CA1 stored 
therein when some cache adapter 3 becomes unusable. Fig. 
11 shows the arrangement example of the region allocation 
of the cache memory in a case where a backup region is 
placed in four cache adapters 3. However, there is no 
limitation on the number of the cache adapter to which the 
backup region is allocated. 

Fig. 9(A) is a view showing one example of the master 
management table 71 possessed by the manager adapter 7. 
The master management table 71 includes a cache adapter 
corresponding table 711 and a cache adapter pair table 712. 
The cache adapter corresponding table 711 has plural 
entries. Each entry has fields 7111, 7112 and 7113 having 
registered therein the LDEV number and information about 
the master cache adapter and backup cache adapter. 
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It is to be noted that registered in the master 
management table 71 is information about the LDEV of the 
entire disk controller 5. On the other hand, registered in 
the management table 31 possessed by the cache adapter 3 is 
5 only information about the LDEV that should be processed by 
its cache adapter 3 as the master cache adapter and backup 
cache adapter. 

The cache adapter pair table 712 has plural entries. 
Each entry includes fields 7121 and 7122 having registered 

10 therein information about the cache adapter pair and 
failure support cache adapter. 

The failure support cache adapter is a cache adapter 3 
that is in charge of storing the backup region in a case 
where a failure occurs in the corresponding cache adapter 3 

15 that is one of the cache adapter pair in the same entry. 
For example, the backup regions of the CA1 and CA2 are 
provided in the CA3 and CA4 in a case where the failure 
occurs in the CA2 in the first entry of the cache adapter 
pair table 712 of Fig. 9(A), which consequently shows that 

20 the arrangement of the storage region of the cache memory 
becomes the one shown in Fig. 8. 

Fig. 9(B) shows the cache adapter corresponding table 
711 having its content changed based upon the cache adapter 
pair table 712 in a case where the failure occurs in the 

25 CA2 from the state shown in Fig. 9(A). A half-tone dot 
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meshing sections in the table are the changed sections . In 
the field 7112 in which the information of the master cache 
adapter is registered, the information of the field having 
registered therein the information for designating the CA2 
5 that is the cache adapter 3 having the failure occurring is 
changed to the CA1 that is the cache adapter pair, and in 
the field 7113 in which the information of the backup cache 
adapter is registered, the sections in which the 
information for designating the CA2 and CA1 are changed to 
10 the CA3 and CA4 according to the information of the failure 
support cache adapter registered in the cache adapter pair 
table 712. 

Fig. 10 is a view showing a procedure of the manager 
adapter 7 (hereinafter referred to as "MA") in a case where 

15 the failure occurs in the CA2 . 

Firstly, the MA recognizes the occurrence of the 
failure in the CA2 . Specifically, the MA recognizes the 
occurrence of the failure in the cache memory 32 in the CA2 
by the report of the occurrence of the failure in the cache 

20 memory 3022 in the CA1 , the report from the host adapter 1 
that transmits the internal message to the CA2 but cannot 
obtain the response due to the failure, or the periodical 
check of the total disk controller 5 by the MA or the 
direction from the administrator via a management interface 

25 (not shown) connected to the MA (step 2201). 
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The MA recognizing the failure of the CA2 confirms by 
the cache adapter pair table 712 of the master management 
table 71 that the cache adapter pair of the CA2 is the CA1 
(step 2202), and scans the field 7112 of the cache adapter 
5 corresponding table 711 to thereby change the section 

designating the CA2 as the master cache adapter to the CA1 
that is the cache adapter pair (step 2203). 

Subsequently, the MA confirms from the cache adapter 
pair table 712 that the failure support cache adapter of 

10 the cache adapter pair (CA1, CA2) is (CA3 , CA4) (step 2204). 
Then, the MA scans the field 7113 of the cache adapter 
corresponding table 711 to change the content of the 
section designating the CA1 and CA2 to the CA3 or the CA4 
that is the failure support cache adapter (step 2205). 

15 Thereafter, the MA transmits the information in the 

entry whose content is changed in the cache adapter 
corresponding table 711 to the host adapter 1 and the cache 
adapter 3 registered in the fields 7112 and 7113 that are 
before the change or after the change of the changed entry 

20 (step 2206) . 

The host adapter 1 or the cache adapter 3 receiving 
this transmitted information reflects the transmitted 
information to the management table 11 or the management 
table 31 possessed by itself. Further, the cache adapter 3 

25 calculates the allocation of the write cache region of the 
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cache memory 3 (step 2207). It is to be noted that the 
processing at the step 2207 will later be described in 
detail. As described above, the MA detects the failure of 
the cache adapter 3 , changes the setting of the backup 
5 cache adapter and transmits the changed information to the 
entire system. By this, the host adapter 1 can change the 
access to the LU that uses the CA2 as the master cache 
adapter to the CA1 and the CA1 can take over the staging 
operation and destaging operation to the disk drive 4 that 

10 is handled by the CA2 . 

Fig. 12 is a view showing a procedure in the cache 
adapter 3 that receives the information transmitted from 
the MA when the failure occurs in the CA2 . 

Firstly, the CA1 that is the cache adapter pair of the 

15 CA2 recognizes the occurrence of the failure from the 

report from the CA2 or the host adapter 1 transmitting the 
internal message, or the investigation of the MA or the 
direction from the administrator via the MA (step 2301). 
The CA1 recognizing the occurrence of the failure 

20 stops receiving the access request to the LDEV managed by 
the CA1 and the CA2 , and writes the data stored in the 
write cache region in the cache memory 3012 in the disk 
drive 4 (step 2302). Thereafter, it receives from the MA a 
part of the information (only the section relating to the 

25 CA1) of the entry in the renewed cache adapter 
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corresponding table 711 and reflects it to the management 
table 31 (step 2303) . 

The CA1 calculates the allocation of the write cache 
region of the cache memory 3 based upon the reflected 
5 content. Specifically, if the number of the failure 

support cache adapter registered in the entry 7122 is m, 
the write cache region that can be used for one cache 
adapter 3 other than the region used for the data backup 
becomes m/(2m+2) of all the write cache regions (step 2304). 

10 The CA1 transmits a write cache region allocation 

request that is the internal message to the failure support 
cache adapter that is the backup cache adapter, here the 
CA3 and CA4 (step 2305). The CA3 and CA4 receiving the 
write cache region allocation request writes the data in 

15 the cache memory 3 to the disk drive 4 until the requested 
write cache region for performing the backup of the data in 
the CA1 and the CA2 can be secured (step 2306). 

The CA3 and the CA4 that secure the write cache region 
requested by the CA1 transmits a write cache region 

20 allocation response that is the internal message to the CA1 
(step 2307). The CA1 confirms the write cache region 
allocation responses from all cache adapters 3 that has 
transmitted the write cache region allocation request, and 
restarts the receipt of the access request to the LDEV that 

25 is stopped at the step 2302 (step 2308). 



As a result, the data written in the cache memory in 
the CA1 is backed up by the CA3 or the CA4 . thereby 
ensuring the redundancy. It is to be noted that, although 
the write of data is performed in the aforesaid procedure, 
5 the subject that the write data is backed up is not the CA2 
but the CA3 or the CA4. 

Subsequently explained is a process in a case where 
the cache memory 32 in the cache adapter 3 having the 
failure occurring is recovered. Fig. 13 is a view showing 
10 a procedure when the CA2 is recovered from the failure 
occurring therein. 

The MA recognizes the recovery of the CA2 from the 
direction from the administrator or the like (step 2401), 
changes the cache adapter corresponding table 711 to the 
15 state before the failure occurs in the CA2 and transmits 
the changed information to each host adapter 1 and the 
cache adapter 3 by the internal message. It is to be noted 
that the state of the cache adapter corresponding table 711 
before the occurrence of the failure is stored in a memory 
20 possessed by the MA. Further, the MA informs the CA1 that 
is the cache adapter pair of the CA2 of the recovery from 
the failure of the CA2 by using the internal message (step 
2402) . 

The informed CA1 writes all data stored in the write 
25 cache region in the CA1 to the disk drive 4 for 
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invalidating the data in the write cache region (step 2403). 
and then, the CA1 transmits a write cache region open 
request that is the internal message to the CA3 and the CA4 
that are the failure support cache adapters that have 
performed the operation of the backup cache adapter (step 
2404) . 

The CA3 and CA4 receiving the write cache region open 
request opens the write cache region corresponding to the 
backup region of the CA1 and the CA2, changes the write 
cache region to the master region and backup region 
respectively of the CA3 and the CA4 before the occurrence 
of the failure and transmits to the CA1 a write cache 
region open response. It is to be noted that the 
information of the master region and the backup region of 
the CA3 and the CA4 before the occurrence of the failure is 
stored in the MA, and the CA3 and CA4 communicate with the 
MA by using the internal message to thereby obtain the 
information for changing the configuration of the cache 
memory 32 (step 2405). 

The CA1 confirming the write cache region open 
response from the CA3 and the CA4 transmits an operation 
start request that is the internal message to the CA2 for 
starting the receipt of the access request to the LDEV that 
is handled by the CA1 (step 2406). The CA2 receiving the 
operation start request starts the receipt of the access 
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request to the LDEV in which the CA2 becomes the master 
cache adapter and the processing as the backup cache 
adapter (step 2407). 

Subsequently explained is a processing for recovering 
from the failure without losing the write data even in a 
case where the failure occurs in the cache memories 32 in 
both cache adapters composing the cache adapter pair. 

Fig. 14 is a view showing a procedure from the 
occurrence of the failure in the CA1 and the CA2 that form 
the cache adapter pair to the recovery from the failure by 
exchanging the cache adapter 3. In this embodiment, the 
explanation is made supposing that, with the state where 
the failure occurs in either one of the CA1 and the CA2 and 
the data that should be stored in the cache memories in the 
CA1 and the CA2 is backed up in the CA3 and the CA4 , the 
failure occurs in the remaining CA. 

Firstly, the failure occurs in both of the CA1 and the 
CA2 that form the cache adapter pair (step 2501). 

The MA recognizes the occurrence of the failure in the 
cache memories 32 in both CA1 and the CA2 that form the 
cache adapter pair from the report from the CA1 , CA2 or the 
host adapter 1, the investigation from the MA or the 
direction from the administrator (step 2502). 

In this case, the MA transmits to the host adapter 1 
an unusable request of the LDEV in which the CA1 and the 
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CA2 are the master cache adapters. The host adapter 1 
receiving the unusable request returns an error showing 
non-accessible to the host computer 6 that requests the 
access to the corresponding LDEV (step 2503), The CA1 and 
5 the CA2 are exchanged to the new cache adapters 3 from this 
state by a maintenance engineer, whereupon the disk drive 4 
is connected to the disk channel 41 as it was and a setting 
is made so that the new cache adapters 3 are recognized as 
the CA1 and the CA2 from the entire disk controller 5, 

10 whereby the cache memories 32 of the CA1 and the CA2 are 
recovered from the failure (step 2504). 

The CA1 and CA2 recovered from the failure transmit a 
write data transmitting request that is the internal 
message to the CA3 and the CA4 that are the failure support 

15 cache adapters and have performed the backup cache adapter 
operation (step 2505). The CA3 and the CA4 receiving the 
write data transmitting request transmit the data stored in 
the write cache region corresponding to the backup region 
of the CA1 and CA2 respectively. After transmitting all 

20 data stored in the corresponding write cache region, the 
CA3 and the CA4 release the corresponding write cache 
region to change the write cache region to the master 
region and backup region of the respective CA3 and CA4 that 
are before the failure occurs in the CA1 and the CA2 (step 

25 2506). 
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The CA1 and the CA2 write the write data received from 
the CA3 or the CA4 to the disk drive 4 one after another, 
and after all write data is processed, they start to 
receive the access request to the LDEV handled by each CA 
(step 2507) . 

Explained so far is about the process for allocating 
the storage region of the cache adapter 3 under the 
leadership of the MA. However, it is possible to execute 
the abovementioned process only by each host adapter 1 and 
each cache adapter 3 . 

Fig. 15 is a view showing an embodiment of a process 
for allocating the storage region of the cache adapter 3 
wherein the MA is not used in a case where the failure 
occurs in the cache memory 32 in the CA2 . In this case, 
each host adapter 1 and each cache adapter 3 has in its own 
management table 11 and the management table 31 the cache 
adapter corresponding table 711 and the cache adapter pair 
table 712 corresponding to the cache adapter 3 to which 
each one can get access. 

Moreover, the explanation so far describes the process 
method for firstly writing to the disk drive 4 all write 
data stored in the cache memory 32 in the other cache 
adapter 3 of the cache adapter pair when the failure occurs 
in the cache memory 32. but a method may be applied wherein 
all write data is not written in the disk drive 4. Here, 
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the former method is called an all destaging method, while 
the latter one is called a copying method, and in this 
embodiment, processes in both methods will be explained. 
It is to be noted that the copying method can be 
5 executed in the process explained so far via the MA. Which 
method is selected is determined depending upon the balance 
between the process time and the reliability. The 
procedure shown in Fig. 16 will be explained hereinbelow. 
The host adapter 1 (hereinafter referred to as "HA1" ) 

10 recognizes the failure in the cache memory 32 in the CA2 
from the internal read request , the response to the 
internal write request or the like, and informs all other 
HAls of the failure of the CA2 by the internal message 
(step 2601). The informed all HAls confirm that the cache 

15 adapter pair of the CA2 is the CA1 and its failure support 
cache adapter based upon the cache adapter pair table 712 
possessed by each HA1, and change the CA2 in the field 1122 
in the management table 11 to the CA1 as well as the CA2 or 
the CA1 in the field 1123 to its failure support cache 

20 adapter (step 2602). Further, the HA1 finding the failure 
informs the CA1 of the failure in the CA2 by the internal 
message (step 2603). 

The informed CA1 performs the following process 
according to the set method. In the case of the all 

25 destaging method, the CA1 writes to the disk drive 4 all 
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data stored in the write cache region for invalidating this 
write cache region. Subsequently, it calculates the 
allocation of the write cache region based upon the number 
of the failure support cache adapter to thereby change the 
5 management table 31. On the other hand, in the case of the 
copying method, the CA1 calculates the allocation of the 
write cache region without invalidating all write cache 
region, and then, writes to the disk drive 4 the write data 
as much as the requested write cache region can be secured, 
10 thereby invalidating the corresponding write cache region 
(step 2604) . 

Subsequently, the CA1 transmits to the failure support 
cache adapter that becomes the backup cache adapter the 
write cache region allocation request by an amount obtained 

15 by the calculation with the internal message (step 2605). 

The failure support cache adapter receiving the write cache 
region allocation request writes the write data in the 
cache memory 3 to the disk drive 4 as much as the requested 
write cache region can be secured, thereby invalidating the 

20 cache region (step 2606). 

It is to be noted that, in the copying method, the CA1 
transmits here the data stored in the write cache region to 
the failure support cache adapter. In a case where the 
transmission is completed or in the case of the all 

25 destaging method, the receipt of the access request to the 
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corresponding LDEV is started in all cache adapters (step 
2607) . 

As described above, in the present invention, a pair 
of a controller (cache adapter) provided with a cache 
memory having the same disk drive is connected via a 
network, thereby establishing a large-scale configuration 
of a storage system, and between the pair of the controller, 
stored in the cache memory of one controller is a copy of 
write data in the cache memory in the other controller, 
thereby making each controller redundant to enhance 
reliability. 

Further, when a failure occurs in the cache memory, 
the controller in which the failure occurs and the 
controller that uses jointly the disk drive with the other 
controller perform a staging and destaging, while the other 
normally operating controller is caused to perform only the 
operation of making the write data redundant, whereby the 
write access response speed and reliability before the 
occurrence of the failure are maintained. 

According to the present invention, reliability is 
enhanced in a storage system having a cache memory. 
Further, a write access response speed and reliability 
before the occurrence of the failure are maintained in the 
storage system having the cache memory. 



